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Abstract 

Most communication systems use some form of feedback, often related to channel state information. In this paper, we study 
diversity multiplexing tradeoff for both FDD and TDD systems, when both receiver and transmitter knowledge about the channel 
is noisy and potentially mismatched. For FDD systems, we first extend the achievable tradeoff region for 1.5 rounds of message 
passing to get higher diversity compared to the best known scheme, in the regime of higher inultiplexing gains. We then break the 
mold of all current channel state based protocols by using multiple rounds of conferencing to extract more bits about the actual 
channel. This iterative refinement of the channel increases the diversity order with every round of communication. The protocols 
are on-demand in nature, using high powers for training and feedback only when the channel is in poor states. The key result is 
that the diversity multiplexing tradeoff with perfect training and K levels of perfect feedback can be achieved, even when there 
are errors in training the receiver and errors in the feedback link, with a multi-round protocol which has K rounds of training 
and K —1 rounds of binary feedback. The above result can be viewed as a generalization of Zheng and Tse, and Aggarwal and 
Sabharwal, where the result was shown to hold forK=\ and K = 2 respectively. For TDD systems, we also develop new 
achievable strategies with multiple rounds of communication between the transmitter and the receiver, which use the reciprocity of 
the forward and the feedback channel. The multi-round TDD protocol achieves a diversity-multiplexing tradeoff which uniformly 
dominates its FDD counterparts, where no channel reciprocity is available. 
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I. Introduction 

Consider the feedback system shown in Figure 1, where the receiver first measures the channel and sends a feedback signal 
to the transmitter about the channel. The transmitter uses the channel information about the channel conditions to adapt its 
transmission strategy. A common feedback model involves quantizing the channel knowledge at the receiver into a finite 
number of bits. The finite level adaptation has been extensively studied for different adaptive transmission schemes like power 
control [1-18], rate control [8], beamforming [19,20] and codebook adaptation [21]. A bulk of the work to-date assumes that 
either the channel knowledge at the receiver is perfect and/or the feedback channel is noise-free. The above body of work 
thus serves as an upper bound to the performance of a system, where channel information at the transmitter and receiver is 
approximate and potentially mismatched. 
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Fig. 1. A feedback based adaptive transmission system. 

Imperfect channel knowledge at the transmitter and receiver is unavoidable in wireless communications since the channel 
H (see Figure 1) is time-varying and hence has to be measured periodically by the receiver to assist in transmitter adaptation. 
To measure the channel, the transmitter sends a periodic finite length training signal, which is used at the receiver to form an 
estimate of the channel. The channel estimate is then quantized by the receiver and sent over a feedback channel which often 
is also a wireless link, and is thus prone to errors. In this scheme, a fundamental question is how much information about the 
channel can be extracted, which both the transmitter and receiver can agree upon. In short, the transmitter and receiver have 
to consent about the state of the channel by conversing over a noisy fading channel. In this paper, we make progress towiirds 
the above question for the case of power control, with the objective to minimize the outage probability. 

If the channel information is known perfectly at the transmitter and the receiver, then the transmitter can perform power 
control to invert the effect of multiplicative fading channel. The inversion conserves power in good channel states by using 
less transmit power, while using more transmit power in poor channel conditions. If the transmitter knowledge is limited to 
a finite-bit approximation of the perfect receiver knowledge, then the quantized power control [2, 8, 12] is an approximation 
of the ideal power inversion. Much like the ideal channel inversion, the quantized power control delivers an average received 
signal-to-noise ratio of SNR by exploiting the fact that poor channel states are very rare and hence large peak powers are also 
used rarely. 

For FDD systems, consider that the receiver quantizes the channel state and sends a feedback index consisting of K levels. It 
was shown in [8, 12] that the diversity increases exponentially in K for mn > 1 and linearly in K for mn = 1. However, in [6], 
we show that if the receiver obtains the channel state information by training and the feedback from the receiver is sent over a 
noisy channel, feedback bits help in resolving the channel state till the noise floor becomes dominant. If the channel estimate 
is below the noise floor, the channel estimate does not have sufficient resolution to extract additional bits of information about 
the channel. This restricts the increase in diversity with feedback bits. We further showed that diversity equivalent of 1-bit 
perfect feedback and perfect channel state at the receiver can be achieved with 1 bit of noisy feedback with two rounds of 
training at the receiver. In this paper, we extend FDD results in two directions. First, we extend the achievability for K bits of 
feedback showing gains for r > min(m, n) — 1. In this regime, more bits of feedback help increase the diversity because more 
than one bit about the channel can be resolved if it is above noise floor, which can then be used for finer power control at the 
transmitter for higher multiplexing gains. Thus, more than one bit about the channel can be resolved using the first training. 

Second, we extend the results in [6] to a quantized iterative protocol where we use K rounds of training with K —1 rounds 
of feedback to achieve the same diversity as K levels of perfect feedback when the receiver knows perfect channel state 
information. The protocol uses K rounds of training and K — 1 rounds of feedback to extract \og2{K) bits about the channel. 
The main idea in the proposed multi-round protocol is that the transmitter opportunistically sends higher training power when 
the previous round of training indicated that the channel is below the noise floor The power used by the nodes to send a 
training/feedback/data symbol in a channel event is governed by the probability of the channel event itself, so that weighted 
average of the transmit power at the nodes meet the average power constraint. The above on-demand use of powers over many 
rounds helps increase the diversity beyond single shot estimation methods considered in all prior work. This is because the 
channel estimates at the nodes can be refined with increasing rounds of communication leading to larger diversities. 
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For TDD systems, the reciprocity in the forward and the backward channel can be utilized to achieve the gains in diversity. 
Due to reciprocity, if any of the nodes have perfect channel state information, the diversity gain is unbounded. However, if none 
of the nodes know channel state, reciprocity allows the receiver to detect if the transmitter has made an error in understanding 
the previous feedback signal. Thus, the receiver can correct transmitter's actions more rapidly compared to the case of FDD 
protocols, where such inomediate error detection at receiver is not possible. The proposed protocol is able to achieve better 
diversity than the FDD results with 1.5 rounds. More precisely, this scheme achieves a diversity of mn{mn + l) — {m + n—l)r 
for multiplexing < r < inin(m, n). For a SIMO/MISO system, this strategy achieves the same diversity-multiplexing 
tradeoff as in the FDD system as the number of feedback levels go to oo. The knowledge at the nodes can be further 
refined by more rounds of training and feedback. In general, a {K — 1).5 round strategy is able to achieve a diversity of 

mn(l + mn + 1- {mn)^~^) — {mn)^~'^{m + n — l)r for multiplexing < r < min(TO, n). The FDD and TDD iterative 

schemes have the same diversity in the Umit as r ^ 0, but the TDD scheme leads to higher gains at aU non-zero multiplexing 
gains. 

The use of multiple rounds is similar in spirit to the improvement in error exponents by using the SchaUcwijk-Kailath-like 
feedback-based coding scheme [22-24] for feedback in Gaussian channels. The multi-round strategies involve zooming into the 
region where there is some uncertainty. In the Schalkwijk-Kailath coding scheme, the transmitter starts with a coarse version 
of the message and then refines it with more rounds of feedback. In our scheme, the channel is unknown and the feedback is 
used to resolve the channel at the transmitter and the receiver. We assume the forward and the backward channels as fading 
channels, but the nodes are not interested in both these channel gains. Thus only partial information is needed to decide upon 
the feedback and the power levels. In our scheme, the receiver learns the channel state information which is a continuous 
random variable unhke learning a discrete message in [22]. Moreover in our scheme, the feedback channel has noise and is 
quantized in the case of FDD protocols unlike [22] where the feedback is a real number received noiselessly at the transmitter. 
Hence the basic idea of successive refinement in the two schemes is similar, but the schemes differ significantly. 

Multiple round protocols are also used in ARQ systems [13]. In [13], the receiver sends an acknowledgement confirming if 
the data can be decoded in each round. However, it was assumed that the receiver knows the channel state information perfectly 
and the feedback is perfect. In this paper, the receiver obtains the channel state information by training and the feedback is 
sent over a noisy fading channel. In this paper, we consider the achievable diversity multiplexing tradeoff when the transmitter 
and receiver "conference" about the channel, which allows the transmitter to perform a more precise power control and achieve 
better performance. The conferencing assumes no Genie-knowledge at any point, and can be viewed as a consensus problem 
over noisy channels. 

The rest of the paper is organized as follows. We formulate the problem in Section II. Section HI and IV describes the 
known and the new results for the FDD and TDD systems respectively. Section V concludes the paper. 



II. Problem Formulation 

A. Two-way Channel Model 

Consider a multiple input-output channel with the transmitting node denoted by T and receiving node denoted by R. We 
will assimie that there are m transmit antennas at the source node and n receive antennas at the destination node, such that 
the input-output relation is given by 

T ^R:Y = HX + W, (1) 

where the elements of H and W are assumed to be i.i.d. with complex normal distribution of zero mean and unit variance, 
CN (0,1). The matrices Y, H, X and W are of dimension n x T^oh, n x m,m x Tcoh and n x T^oh, respectively. We assume 
that Tcoh is coherence interval such that the channel H is fixed during a fading block of Tcoh consecutive channel uses, and 
statistically independent from one block to another. We further assume that Tcoh is finite and do not scale with SNR. The 
transmitter is assumed to be power-limited, such that the long-term power is upper bounded, i.e, ;p^trace(E [XX^^]) < SNR. 

We consider both frequency-division and time-division duplex models for the feedback channel. In both cases, we assume 
that the same multiple antennas at the transmitter and receiver are available to send feedback, in a half-duplex manner. For the 
feedback path, the receiver will act as a transmitter and the transmitter as a receiver. As a result, the feedback source (which is 
destination for data bits) will have n transmit antennas and feedback destination (which is source of data bits) will be assumed 
to have m receive antennas. Furthermore, a block fading channel model is assumed for the feedback channel, 

R ^ T : y> = HfXf + Wf, (2) 

where Hf is the MIMO fading channel for the feedback link, and the Wf is the additive noise at the receiver of the feedback; 
both are assumed to have i.i.d. CiV(0, 1) elements. The feedback transmissions are also assumed to be power-limited with 
a long-term power constraint given by j^trace(E XfX^^ ) < SNR/. Without loss of generahty, we will assume the case 
where the transmitter and receiver have symmetric resources, such that SNR = SNR/. All our results can be easily generaUzed 
to the case of asymmetric resource usage. 
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For the case of FDD systems, H and Hf aie statistically independent. On the other hand, for the case of TDD, we assume 
that the H and Hf are perfectly correlated within one coherence interval, and adopt a phase-symmetric two-way channel model 
with Hf = H'^ [4,5]. 
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Fig. 2. Two-way fading channel, wliere the forward and feedback channels use the same antennas. 



B. Multi-round Protocols to Acquire Channel State 

The two-way channel model allows the transmitter and receiver to conduct multiple rounds of information exchange. In this 
paper, we will consider coding strategies where the transmitter-receiver perform a multi-round exchange to estimate the channel 
H, before sending the data in order to maximize system diversity order A round is defined to consist of two transactions: a 
transmission sent from node T to node R and a return transmission from node R to node T. We also define half-round, where 
only one node sends a transmission without receiving a transmission in exchange. 

Figure 3 depicts the input-output signals and the temporal dependence between them. In round i, transmission from 
node T is denoted as Xi (^X^^^ ,Yj~^^ where X'^^ — {Xi,X2, ■ ■ ■ ,Xi^i\ are all the previous inputs and Y^^^ = 
{1/1, Y/ 2, • ■ • i^/.i-i} are all the previous outputs of the feedback channel. Analogously, we define feedback channel input 
Xf i (^Xy'^ ,Y^^ . Since both T and R are assumed to be half-duplex and the multi-round protocol is always initiated by the 
transmitter T, the signal Xf ^ from node R can depend on the last round of received signal Yi. 
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Fig. 3. Multi-round protocols: (a) Input-output relations and (b) temporal dependence of signals. 



C. Diversity-Multiplexing Tradeoff 

In this paper, we will focus on the asymptotic regime of large SNR. Thus, we adopt the notation of [25] to denote = 
to represent exponential equality as a = 6 if limsNR^oo log(a)/ log(SNR) = limsNR^cc log(6)/ log(SNR). However, a = 
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would mean that a decays faster than any polynomial in SNR. In this paper, we are only concerned with probability decaying 
polynomially with SNR and hence the probabihty of events that decay faster than polynomial decay (example, exponential 
decay) will not enter the calculations. We similarly use <, >, <, > to denote exponential inequaUties. 

We will only consider the case when the codeword X spans a single fading block. Based on the transmitter channel knowledge 
G, the transmitted codeword is chosen from the codebook Cq = {Xg{^)-iXg{2),- ■ ■ , X(3(2^^«°'>)}, where R is the rate of 
the codebook. All XG(fc)'s are matrices of size m x Tcoh- In this paper, we will only consider single rate transmission where 
the rate of the codebooks does not depend on the transmitter knowledge, and the codebooks are derived from the same base 
codebook C by power scaling of the codewords. In other words Cq = y/PoC, where the product implies that each element 
of every codeword is multiplied by y/Pc where each codeword X{k) e C has unit power. Thus, Pg is the power of the 
transmitted codewords. Recall that there is an average transmit power constraint, such that ]E(Pg) < SNR. AH coding strategies 
in this paper assume Gaussian input distribution. 

Since our focus is on the delay-limited regime of single codewords, we will use outage as our metric. Outage is defined as 
the event when the mutual information of the channel given instantaneous realization of the estimate Hr of the channel H 
at the receiver, I{X; Y\Hr) is less than the desired rate R. Let n(0) denote the probability of outage, where O is the set 
of all the channels where the transmitted rate R is less than the maximum supportable rate I{X; Y\Hk). The system is said 
to have diversity order of d if 11(0) = SNR"*^. Note that all the index mappings, codebooks, rates, powers are dependent on 
the average signal to noise ration, SNR. Specifically, the dependence of rate i? on SNR is explicitly given by i? = r log SNR, 
where r is labeled as the multiplexing gain. The diversity-multiplexing tradeoff is then described as the maximum diversity 
order d{r) that can be achieved for a given multiplexing gain r. 

If Hfi = H, I{X;Y\H) = logdet (H- ^HQH^) is the mutual information of a point-to-point Unk with m transmit 
and n receive antennas, transmit signal to noise ratio Pq and input distribution Gaussian with covariance matrix Q [25]. The 
dependence of the index at the transmitter is made explicit by writing the transmit SNR as a function of transmitter channel 
knowledge G. If Hr ^ H, we will consider an achievable strategy considering the estimation error as noise as in [26] and 
hence 2^(-^;'^l^'^)> det (lA PgHrH^'' ^^261 

For the case of perfect receiver information and no transmitter information, let the codebooks of rate R = r log SNR and 
power P = SNR^ are used. The outage event is defined as 0{R, P) = {H : (l + ^HQW) < R}, the probabihty of which 
is Il{0{R,P)) = SNR"'^^'''*') thus giving the diversity-multiplexing tradeoff of d = G{r,p) [11], where 

min(m,n) 

G{r,p) = inf (2i — 1 -|- max(m, n) — min(m, n)) a^, 

and 

min(m,n) 

A{r,p) = {a™"'"'"Vi > • • • amin(m,n) > 0, ^ {p- ai)+ < r}. 

i=0 

The function G{r,p) defines a piecewise Unear curve connecting the points {r,G{r,p))= {kp,p{m — k){n — k)), k = 
0, 1, . . . , min(m, n) for fixed m, n and p > 0. We define Gu{r) recursively as follows. Let Go(r") = 0, and G„(r) = 

G{r,\ + Gu-i{r)) for ?i > 1. 

Remark 1. Consider a MIMO system with m transmit and n receive antennas with perfect channel state information at the 
receiver. The diversity multiplexing tradeoff with no CSIT is dcsiR = G{r, 1) = Gi(r) [25]. Further, the diversity multiplexing 
tradeoff of rfcsiRT^ = GK{r) can be achieved with K levels (or equivalently h = log2(-f^) bits) of quantized feedback from 
the receiver about the channel [8]. 

III. FDD Protocols: Iterative Quantization 
In this section, we extend our 1.5 round protocol of [6] in two directions. First, we show that for non-zero multiplexing 
gains, more than 1-bit of information about the channel can in fact be extracted and sent reliably across a noisy feedback 
channel. Furthermore, the number of bits of information which can be extracted depends on the multiplexing gain. Thus a 
higher diversity order can be achieved than the proposed techniques in [6] for r > 0. Second, we construct a multi-round 
protocol, which can extract more bits about the channel at all multiplexing gains, including zero multiplexing point. The 
protocol iteratively refines the channel information at both ends, and also keeps the two nodes aligned in their knowledge 
about the channel estimate. 

A. Prior Results: 1-bit about the channel in 1.5 rounds 

The diversity-multiplexing tradeoff for the case of receiver with perfect information and noiseless quantized information has 
been extensively studied in [2,8,9]. In this subsection, we will review the results of [6] where imperfect training and noisy 
feedback was considered for FDD systems, described in the new terminology of Section II. 
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1) Round 1 (forward): T sends a training at power SNR, Xi = VSNR /3, where f3 is known fixed sequence. The signal 
received at R is Yi = HXi + W, which is used to form an MMSE estimate Hi. ^ 

Round 1 (reverse): The feedback from R is a 1-bit quantization of the channel estimate qi : Hi {0, 1}, of the 
estimated channel such that 

qi{Hi) = i . . . (3) 

I 1 , otherwise 

where Oi = {Hi : det(/ + ^i^^JSNR) < SNR''+'} The quantized bit is sent as follows: 

fVsn^0j, qi{Hi)^o 

where /3/ is a known sequence. The signal received at node T is = HfXf^i + Wf. 

2) Round 2 (forward only): Node T decodes the bit of information about the channel from Yf^i, labeled as qi (found by 
received power estimate with thresholds of SNR*/™" for some 6 « e), and forms a power controlled signal 

where x = [0 c] , where c is the data codeword concatenated to the training signal /3. 

Theorem 1 ([6]). For e < r < m — e, the above strategy achieves the diversity multiplexing tradeoff of di, 5 ^2{r) = G{r, 1 + 
G{r, 1)) by choosing e arbitrarily close to 0. 

Remark 2. Note that the signals in the protocol depend on the last channel output; in particular Xf^i{Yi) and X^iYf^i). In 
the multi-round protocol discussed in Section ni-C, we will consider the generalized feedback structures, which will depend 
on all previous channel outputs, as described in Section II-B. 

Remark 3. Unlike the previous work in [4,26], we do not account for the resources spent in channel training and feedback 
in this paper. Resource accounting can be performed using the procedure developed in [4], by scahng the multiplexing r 
appropriately. More precisely, the multiplexing r should be replaced by rT/{T — 2m— 1). The resource accounting multiplier 
assumes that the feedback requires one channel use and the training requires m channel uses. Further, the number of antennas 
would need to be optimized for each multiplexing gain, as in [4,26]. 

A quick note on the notation dp^K{r) for diversity-multiplexing tradeoff: the first subscript p refers to the number of rounds 
in the protocol and the second subscript K refers to the number of levels communicated by the receiver in each round. All 
FDD protocols discussed in this paper rely on receiver sending quantized information about the measured channel back to the 
transmitter. In the TDD protocols, the channel syrmnetry will be exploited and transmitter will also convey channel information 
to the receiver. Further in TDD protocols, a power-controlled training symbol may be fed back from the receiver which takes m 
channel uses and is equivalent to sending soft information about the channel, instead of quantized bits like in FDD protocols. 
Thus, the second subscript K will be suppressed and dp{r) will be used to denote diversity order of TDD protocols. 
Example 1 (1.5 round protocol, 1 bit feedback): Consider a SISO system where the receiver now does not know the value of 
channel estimate H, but only an estimate Hi. The above strategy reduces to the receiver deciding if H G Oi, where 

di = {^1 : log (1 + I^iI^SNr) < i?} . (6) 

Let a be the negative SNR exponent of while a be the negative SNR exponent of Note that Oi represent the 

event a > 1 — r. Since a = a with probability 1 when a < 1 (For more details, the reader is referred to Appendix A-1.), 
this decision from the receiver is reliable. Further since U{di) = SNR-^i-"), the receiver uses a power level of SNR^-'' to 
communicate Oi, while SNR" to communicate the possibihty of Of. This is a special case of using power-controlled feedback; 
the reader is referred to Appendix A-2. If the transmitter gets Oi, it transmits with power SNR^"'^, else uses SNR. Hence, 
the two error events are that the transmitter made an error in decoding which happens with probability SNR~^^^ while the 
second is that the power of SNR^"'' is not sufficient which happens with probabihty SNR~^^^~''\ Thus, diversity of 2(1 — r) 
can be achieved. ■ 



B. First Extension: log2(-ftr) bits about the channel in 1.5 rounds 

In this subsection, we will extend the results of Theorem 1 to more than 1 bit of feedback for non-zero multiplexing gain. 
The key observation is that while no more than 1-bit can be extracted for zero multiplexing gain [6], the channel estimate 
Hi has enough resolution to derive more bits about for non-zero multiplexing gains, r > 0. The modified protocol can be 
described as follows. 
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1) Round 1 (forward): T sends a training at power SNR, Xi = \/SNR/J, which is used by the receiver to form an MMSE 
estimate Hi. 
Round 1 (reverse): Let 

Sk-I = {Hi : dct(/ + i^^ijtsNRl+min(cM,GK-2(r+<:))) ^ SNR''+^} , (7) 

where cm = { ^~m^^ )~^' where the notation denotes max(a;, 0). Further, for any u e [0,K — 2], let 0„ be defined 



as 



du = d'k-i n I n ^Jj n {^i • det(7+ Fi^|SNR1+™('="'^''(''+^») > SNR''+^} . (8) 

The feedback from R is a level quantization, qi : Hi {0,1, . . . , K — 1} computed as follows, 

qi{Hi) = u if Hi e du. (9) 
Let Pu = min(G„(r + e), cm)- The quantized signal qi{Hi) is sent as 



fVsN^^^^^^^^+^f, qi{Hi)=u<K-l 
""^■^ ^"^^^ - WsNR^+«(^+-^+--) /3„ qiiHi)=K-l ■ ^''^ 

The signal received at node T is = HfXf^i + Wf. 
2) Round 2 (forward only): Node T decodes the K level information about the channel from Yj i, labeled as (found by 
received power estimate with thresholds of SNR'^'^'*^^'^"^'^ +s/mn some 5 « e), and forms a power controlled 
signal 

ry^^^iTo^^^ $-i = u<i^-i 

\ ^sNRi+-'"(^--('-+^)'^('^+^'i+^")) X, qi = K-l ' 
where x = [(3 c] , where /3 is known training signal as above and c is the data codeword. 

Theorem 2. For e < r < ni — e, the above strategy achieves the diversity multiplexing tradeoff of di,5^K{r) = min{G(r, 1 + 

G(r, 1 + Cm)), Gk{i^)} by choosing e arbitrarily close to 0. 

Proof: The proof is provided in Appendix B. ■ 
We first note that for r > m — 1 and K > 2, the diversity order di.5,if (r) achieved in Theorem 2 is greater than the 
diversity order (ii.5.i(^) achieved in Theorem 1. For r > m — 1, a finer power control is possible in the region where the 
channel condition is good, i.e, where the channel estimate dominates the noise floor. In contrast, when r ^ 0, the channel 
is not resolvable beyond one bit of information. The increased channel resolvabihty at higher multiplexing gains is the main 
reason for increased diversity order. The following example makes the above intuition exphcit. 

Example 2 (1.5 rowid protocol, 3 level feedback, r = 1/6): Consider a SISO system where the receiver only has a channel 
estimate. Hi. The above strategy reduces to the receiver deciding if Hi G Oi, where 

02 = {Hi : (l + l^il^SNR^/^) < SNR^/''} (12) 

di = (5^n {^1 : (l + I^iI^SNr) < SNR^/^} (13) 

do = dlr\dl. (14) 

Let a be the negative SNR exponent of while a be the negative SNR exponent of \Hi\'^. Note that Oi represent the 
event 5/6 < a < 1 and O2 represent the event a > 1. Since a = a with probability 1 when q < 1, this decision from 
the receiver is reliable. The three events Oq, Oi and O2 are transmitted from the receiver at powers of SNR^^^, SNR° and 
SNR^ respectively. The power levels used by the transmitter in these three events are SNR, SHR''^ and SNR^ respectively. It 
is easy to see that the power constraints are asymptotically satisfied. Outage occurs when less power is used due to error in 
the feedback or when the power of SNR^ was not sufficient. The event that less power is used due to feedback error happens 
with probability SNR"^^"^^^^"*"^ and the event that power of SNR^ is not sufficient to avoid outage happens with probability 
SNR"^^"^/'^^ Hence, a diversity of 2 - 1/6 = 2 — r can be achieved. Note that this is larger than 2 — 2r which is achieved 
with 1 bit of feedback. The increase with the additional level of the feedback is because a = a (with probability 1) as long as 
S < 1 which gives perfect resolution for a from the estimate as long as S < 1. The first feedback decides whether a > 5/6 
and since there is a gap between 5/6 and 1 in which the channel can be resolved perfectly, the next feedback level resolved 
whether the channel satisfies 5/6 < a < 1. However at multiplexing r — > 0, no more than 1-bit of information could be 
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derived from the estimate since the channel cannot be resolved when S > 1. ■ 
In fact, the number of useful bits which can be derived about the channel can be linked to the multiplexing gain as shown 
in the following corollary. 

Corollary 1 (Bits About the Channel). The maximum number of bits b about the channel, for sake of power control, which 
can be derived from the channel estimate Hi as a function of multiplexing gain r is given as follows. 

1) SISO: log2(-fl' + 2) noiseless bits about the channel can be derived if the multiplexing gain r g (y^^^-, j^qif • 

2) MIMO: For r < m — 1, one noiseless bit can be derived. For m — I < r < m, log(-ftr + 3) bits can be derived, where 
K = maxju : u > 0, Cm > (j„(r)}. 

The final coroUary of this section captures the performance of rfi.s^if (r) as the number of levels increases. We note that due 
to error in channel estimation and the feedback channel, the diversity order rfi.5,i<-(r) does not increase unboundedly with K 

for a given r. 

Corollary 2. As K ^ oo, the above diversity multiplexing tradeoff reduces to G{r, 1 + G(r, 1 + Cm)) for any < r < 
min(TO, n). This is because Gxir) is a monotonic increasing function with K and goes to oo as K ^ oo. 

For the special case of m = 1, cm = r and lim/f^oo dK{r) = + n(l — r). Kim et al. considered a model of imperfect 
channel estimate at the transmitter in [10] where the transmitter knows the correlated channel estimate with a certain correlation 
while the receiver knows the channel state information perfectly. However in our case, the receiver knows a correlated version 
of the channel state which is shared with the transmitter via noisy quantized feedback. Even though the two cases are different, 
in the special case of m = 1, both the models yield the same diversity multiplexing tradeoff. 



C. Iterative Quantization: log2(-ftr) bits in {K — 1).5 Rounds 

In the previous two sections, the protocols used only 1.5 rounds. As a result, for r — » 0, no more than 1-bit about the channel 
could be extracted. In this section, we show how multiple rounds can be used by the transmitter and receiver to iteratively 
refine the knowledge about the channel H beyond one bit for all multiplexing gains, including r = 0. In each round, the 
receiver sends a binary signal back to the transmitter, providing an additional level of channel information. To perform the 
iterative quantization, both transmitter and receiver occasionally and opportunistically use large transmit power in some blocks. 
However, the use of large power is performed very rarely, allowing both nodes to stay within their prescribed average power 
constraints over the long-term. 

For sake of clarity, we will state the multi-round iterative quantization protocol when only binary messages are conveyed 
by the receiver in each round. Thus there is 1-bit quantization by the receiver in each round. All our results can be extended 
to feedback messages belonging to a larger alphabet in each round (Uke in Section III-B). 

1) Round 1 (forward): T sends a training at power SNR, Xi = VSNR (3, where /3 is a known fixed sequence. The signal 
received at R is Yi = HXx + W, which is used to form an MMSE estimate Hi. The feedback from R is a binary 
quantization, qi{Hi) such that 

Jo, ifHiedl 

qi{Hi) = i . , (15) 

II, otherwise 

where Oi = {Hi : det(/ + ifi^jSNR) < SNR^+'}. 

Round 1 (reverse): The quantized channel qi{Hi) is modulated as follows: 

Y (Y^-l-^l^f^ Qi{Hi) = 

{Yi) = < g. (16) 

[O./Sf, qi{Hi) = l 

where /?/ is a known sequence. The signal received at node T is Yf^i = HfXf^i + Wf. The transmitter T estimates qi 

as qi using a MAP detector for power level with a threshold of SNR*^^™". 

At the end of round i — l, the transmitter forms an estimate of the quantization performed at the receiver, which is 
denoted by the function qi-i{-). To state the following recursion, we assume qo = 1. The estimate is used in the 
i*'* round by the transmitter as described below. 

2) Round i G {2, . . . ,K — 1} (forward): The transmitter trains the receiver with X^ = y/PiP where the power level Pi 
chosen as follows 

p. -JO' if*-i=0 

\SNRi+^-('^+^), otherwise ' ^ ' 

The receiver's actions can be described as follows. 
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• If any of qu{Hu,q" ^) = for u < i — 1, then the receiver performs a MAP power estimation to check if the 
transmitter sent a training at power level of or SNR^^*^'"^^'^"'''^^ The associated MAP threshold is SNR*^. If power 
estimate > SIMR', qi{Hi,q'-^) = 0, else q,{Hi,q'-^) = 1. 

• If qu{Hu-q"^^) = 1 for all u < i — 1, the receiver estimates the channel as Hi assuming that the training power is 
SNR^^*^'"^*^'^''''^^ and bases the feedback signal based on the estimate as follows 

(li\tii-,q ) — \ , . : (.io) 

I 1, otherwise 

where = {H, : det(/ + ^,ffJ'SNRi+^"(''+^)) < SNR'^+^j. 

3) Round i G {2, . . . , if — 1} (reverse): The receiver sends the signal 

^^_|VSNRi+«-('-+^)/3/, q,{%,q^-^)=Q ^ ^^^^ 
1 ./3/, otherwise 

where /?/ is a known sequence. The signal received at node T is = HfXf^i + Wf. If qi-i = 0, then % = 0. That is, 
if the transmitter had concluded that it was a "good" channel after round i — then it does not attempt to re-estimate the 
feedback signal. Otherwise, the transmitter decodes the power level by a MAP detection with a threshold of SNR^^™" 
to get an estimate of qi to get gj. 

4) Round K (forward only): The transmitter then trains the receiver and then sends the data both at a power level of 
SNRi+G„(r-+e) ^j^gjg y ^ e[l,K -l]:qy = 0}\JK}- 1. 

Theorem 3. The iterative quantization protocol, described by above Steps 1-5, achieves a diversity order of d(^K-i) 51 = 
GK{r). 

Proof: The proof is provided in Appendix C. ■ 

Note that this diversity is same as that with K levels of perfect feedback when the receiver knows perfect channel state 
information as given in [8J. Thus {K — 1).5 rounds allow the transmitter and receiver to agree upon log2(iir) bits about the 
channel. Note that the bits about the channel H relate to the outage regions, and hence the Voronoi regions are concentric 
spheres, all of which are centered at origin. In Section III-D, we will provide further intuition to the above result. The following 
example serves as the stepping stone to the general discussion. 



Example 3 (2.5 round iterative quantization, 1 bit feedback per round): Consider a SISO system where the receiver now 
does not know the value of channel estimate H, but only an estimate Hi. The above strategy reduces to the receiver deciding 
if Hi gOi, where 

di = {^1 : log (1 + I^iI^SNr) <R = rlog(SNR)} . (20) 

Let a be the negative SNR exponent of p while a be the negative SNR exponent of Note that Oi represent the 

event a > 1 — r. Since a = a with probabihty 1 when S < 1, this decision from the receiver is rehable. The receiver uses 
a power level of SNR° to communicate Oi and SNR to communicate the possibiUty of 0°. If the transmitter receives OJ, 
then this was the only possibility of being transmitted (ignoring the events with probability = 0). However, if the transmitter 
receives Oi, OI could still have been transmitted and the error happened with probability SNR~^. 

In the second round, the transmitter trains at a power of SNR^-'' only if it receives Oi. If the receiver earUer sent OI, 
the channel was good. Hence, a MAP detection is done to estimate if the transmitter made a mistake. The error made b^ the 
transmitter would be perfectly known at the receiver in this case. If the receiver earUer decided that the channel was in Oi, it 
now estimates the channel H2 and decides if H2&O2, where 

O2 = {h2 ■■ log (1 + l^apSNR^-'-) <R= rlog(SNR)} . (21) 

Note again that the decision is being made if a, < p — r which will be correct as the estimate and the actual channel will 
have same exponent in this region. If the receiver sent O2 and there was no error at transmitter, or the receiver sent O2 and 
is in O2, the receiver uses a power level of 0. However, in the remaining cases, power of SNR^""" is used. If there is an error 
in feedback, higher power will be used and hence there is no outage associated with this error. 

Now, the transmitter knows if SNR or SNR^"*^ might be fine or not. Hence, uses the three power levels of SNR, SNR^"'^ 
and SNR"^"^'^ in these three cases. There is an outage when power level of SNR^"^'^ is not sufficient to avoid outage, which 
happens with probability SNR"'^'^"'^^ giving a diversity of 3(1 — r). ■ 
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D. Discussion of Iterative Quantization 

Although, the diversity of GKir) can be obtained with perfect channel state at the receiver and perfect quantized feedback 
of K levels, achieving this gain when both the channel estimates and the feedback are noisy is a challenge. Since the receiver 
has to use training symbol to estimate the channel, it cannot resolve the channel gains which are smaller than the estimation 
error. Recall that power control gain results from using large instantaneous power in poor channel conditions. Since such poor 
channel states are rare, the transmitter has to use large power rarely and hence can still meet the average power constraint. 
However, it is precisely the identification of the poor channel states which is not possible due to error in channel estimation 
at the receiver. 

Ideally, one should use a large power, say SNRP,p > 1, for the training signal, which is the sequence /? in the protocols 
described in Sections III- A, III-B and III-C. The large training power means that the error floor is reduced to SNR~^ and we 
can identify channels gains of the order of SNR~^ with high reliability. However, the use of SNR^,p > 1 in every frame will 
violate the average power constraint of SNR. 

The solution is to use large training power only when the channel gain is too small to be resolved using lesser training 
power. Thus, the probability of using large training power should be governed by the probability of channel event, and thus the 
weighted average of transmitter power will meet the average power constraint of SNR. The above on-demand use of training 
power is precisely the main idea behind iterative quantization described in Section III-C. 

The other key ingredient is power-controlled feedback, which skews the feedback errors in one direction. The receiver in each 
iteration communicates to the transmitter if the current power level will be enough to avoid outage. However some feedback 
errors are worse than others. Consider the case of binary feedback. If the channel is Good (logdet(7 -|- HH^SV^R^) > r for 
appropriate p and r) but the transmitter decodes it as a Bad channel and uses larger power than needed, no outage occurs 
during data transmission. So confusing Good channels for Bad channels is a non-issue, especially due to additional round of 
power-controlled training. However, if a Bad channel is confused as a Good channel, then the transmitter will use lower power 
than needed and hence as a result, there will be a definite outage. Thus, confusing Bad channels with Good channels is the 
main contributor of outage due to feedback errors. 

Thus the desired situation will be that Prob(Bad — > Good) is as smaU as possible while ensuring that the probabilities of the 
events {Good, Bad} as seen by transmitter is same as those seen by the receiver; this last constraint on event probabilities at 
transmitter ensures that average power consumed is SNR. All of the power-controlled feedback designs proposed in previous 
sections skew the feedback power allocation to lower Prob(Bad — > Good) at the expense of limiting the maximum power. This 
is because the probability of Good event is asymptotically much higher than the probability of Bad event and thus if higher 
power is used for the Good event, the maximum power would be limited by SNR/n(Good). The Bad event is encoded with 
a codeword of zero power by the receiver on the feedback link, which is decoded by the transmitter using an energy-based 
estimator. The probability of confusing a zero power codeword with any codeword of power greater than SNR'', (5 > 0, i.e. 
Prob(Bad Good), decays exponentially fast. In comparison, the probability of confusing a Good channel with a Bad due 
to decoding an SNR codeword as a zero power codeword decays polynomially fast. 

The iterative quantization can be more easily understood by a pictorial view of the decision process in Example 3. In 2.5 
rounds with 1-bit per feedback round, we can resolve channels into one of 3 regions depicted in Figure 5(c). In the first round 
(forward), the receiver can distinguish between the events 0\ and Oi . This is communicated to the transmitter using a power- 
controlled feedback, where qi = I representing event Oi is encoded with zero power codeword. As a result, if qi = 1, then 
qi will be 1. Now, consider the case when the actual event is Oi. In this case, in the second round (forward), the transmitter 
sends a training at power SNR^ which allows the receiver to distinguish between Oi \ Oi and Oi. Once again, if qi = 1, a 
zero-power codeword is used by the receiver, which implies that the transmitter concludes qi = 1 with exponentially small 
error probabihty. Hence, the transmitter knows gi = 1 inside Oi and qi = 1 inside Oi (with probabihty 1). Thus, we find that 
the transmitter has received an index that is the same or bigger than that transmitted by the receiver in all the scenarios. This 
indicates that there will be no error because of the transmitter receiving the index by feedback incorrectly since the transmitter 
will send at atleast the power that the receiver suggested (with probability 1). Since the feedback errors do not dominate and 
in each feedback and the receiver adds one level precision in each round, the diversity corresponding to K levels of perfect 
feedback is attained with {K — 1).5 rounds of iterative quantization protocol. 

The key to iterative quantization is that the channel is resolved to a finer level only when needed. As and when the channel 
is below the channel estimation error floor, the transmitter sends a higher power training compared to previous rounds. The 
on-demand nature of the protocol ensures neither the transmitter, not the receiver exceed their power budget and use high 
powers only in rare cases. 

IV. TDD Protocols: Exploiting Channel Symmetry 

In the FDD systems, the forward and feedback channels are independent. Thus, the feedback channel fading and errors 
had to be dealt independent of the forward channel, or in other words, information about one channel cannot be exploited 
for better encoding on the other channel. In contrast, the synmietry of TDD channels can be effectively exploited by using 
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(a) (b) (c) 

Fig. 4. Channel events (a) in the first round (forward), (b) in the first round (reverse) and (c) in the second round(forward). 

the knowledge of one channel for transmission on the other. We will show that this symmetry allows us to achieve higher 
diversity-multiplexing tradeoffs. 

The study of training based TDD systems from diversity multiplexing perspective was initiated in [4] for a MISO/SIMO 
system, where a receiver-initiated strategy was proposed and can be cast as a 1.5 round scheme. This was further extended to 
a MIMO system in [18], where it was shown that a diversity order upto 2mn (for zero multiplexing gain) was achievable. 

In this section, we will generalize the prior result and propose a multi-round TDD protocol which iteratively refines the 
channel information at both ends, and also keeps the two nodes aligned in their knowledge about the channel estimate. Unhke 
the FDD iterative protocol, this protocol uses the fact that the errors in the forward and the feedback path will be correlated since 
both paths have same eigen-values due to the reciprocity of the channel. With this protocol, the diversity-multiplexing tradeoff 
is a straight fine and dominates the diversity-multiplexing tradeoff for the FDD iterative quantization for all < r < m with 
(ii'— 1). 5 rounds for any > 2. For the special case of JsT = 2, the diversity multiplexing tradeoff of mn(mn-|-l) — (m-|-n—l)r 
can be achieved using a power controlled strategy at the transmitter based on its own channel estimate. This power control is 
similar to that in [10] where the transmitter has a channel estimate with some error variance. While the model in [10] has no 
feedback channel, we can use the similar structure due to reciprocity of forward and feedback channels. 

A. Known Results:!. 5 rounds 

In this subsection, we review the result in [5, 18] for the achievable diversity multiplexing tradeoff for a MIMO channel. 
The authors considered a receiver initiated protocol, which in the terminology of this paper can be presented in 1.5 roimds in 
which the transmitter in the first round remains silent. The protocol is described as follows, 

1) Round 1 (forward): The transmitter remains silent. 

Round 1 (reverse): The receiver sends a training at power SNR, Xf^i = \/SNR /3, where /3 is known fixed sequence. 
The signal received at T is Yf i ~ HXf i + W, which is used to form an MMSE estimate Hi. 

2) Round 2 (forward only): Let Hi = [hj ■ ■ ■ and \j = hjh^^. The transmitter trains the receiver and sends data 
using a power level of = ^^^j!^'^ ^. . 

Theorem 4 ([5]). For <r <1, the above protocol achieves a diversity multiplexing tradeoff of mn{2 — r). For 1 <r <m, 
diversity order of G{r, 1) is achievable. 

Thus, we see that the above protocol converts the MIMO system to a parallel MISO channels and uses the worst of the 
parallel channels to determine the power control. For r < 1, the power control mentioned in this protocol achieves better 
diversity as compared to the diversity achieved without without feedback. However for r > 1, the achievable diversity is same 
as that with no feedback system of [26] as feedback for each MISO channel improves the diversity only in the range r < 1. 

In the case of FDD systems, if the receiver knows perfect channel state information and there is a noisy feedback from the 
receiver, the diversity is limited by the noise in the feedback [6J. Further, if the transmitter learns the channel state through 
a quantized noiseless link and the receiver does not know the channel state information, the diversity is limited by the noise 
in the forward channel and the quantization [6]. Even if the feedback link is perfect and has infinite capacity, the training 
cannot be sent on the feedback channel since the training will indicate about the quality of the feedback channel rather than 
the forward channel. Hence the best estimate that the transmitter can receive is the noisy estimate from the receiver. On the 
contrary, in TDD systems, infinite diversity can be achieved if the receiver or the transmitter knows the channel state perfectly. 
This is shown in Appendix D. 

B. Iterative Protocol with (K-l).5 Rounds 

In this Section, we will provide an iterative protocol for a TDD system considering the noise in the training and the feedback 
channels. This protocol achieves better diversity multiplexing tradeoff than the known results even for the special case of = 2. 
We will now prove a Lemma that wiU be used to analyze our proposed iterative protocol. 
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Lemma 1. Suppose that the transmitter is trained with a power o/SNR^/or p > 1, so that the transmitter estimates the 
channel Hi with the eigenvalues of HiH\ being = SNR^"'. Further, the transmit power for the training signal from the 
transmitter is P{Hi) = ]-[">jvsi^^[^-^(2i-i+|n-mi)a^ • Then, the probability that the channel cannot support the rate of R on this 
power controlled channel Is asymptotically equivalent to 

n{\ogdei{I + P{Hi)HH^) <R) = snR-'""p-^('''1+('""-i)p) = snr-™"(i+p™")+('"+"-i)'-. 

Proof: This case results in transmitter knowing noisy CSIT, and hence the result is similar in nature to that in [10]. 
Although the result is similar in nature, we provide the proof in Appendix E for completion. ■ 

Remark 4. We note from the proof of Lemma 1 that the outage event (event that logdct(/ + P{Hi)HW) < R) has 
exponentially small probability given am < P- So, the outage happens when all the > p, or in other words when all the 
eigen-values of HH^ are in the bad state. Further, in this bad state the channel estimation does not work well in the sense 
that given that S; > p, all one can state about ai is that ai > p with probability one. For example, if cti > lOOp, all one can 
rehably say about ai is that it is > p. In SISO case, the above property of a^ implies that the channel cannot be resolved below 
the noise floor since the noise dominates the training signal. The interesting implication in MIMO is that this result of noise 
dominance holds for all the eigen-values. None of the eigen-value of the channel can be resolved beyond ai > p if on > p. 
Raising the power exponent p for the feedback reduces the probability of the bad state («„ > p) thus increasing diversity. 

We now describe the iterative, multi-round TDD protocol. The basic idea of the protocol hes in Lemma 1 . If a higher power 
is allowed on the feedback or training or both, higher diversity can be obtained. However, if higher power is used for all 
transmissions, then the nodes will violate their power constraint. Hence, we iteratively refine the information about the channel 
and use higher power for the feedback on-demand. Since the large powers are used rarely, both the nodes stay within their 
prescribed long-term power constraints. 

Let Wi{r) = 0, W2{r) = mn + G{r, mn), Wk{r) = mn{l + Wk-i{r)) - e for A; > 2. 

1) Round l(forward): The transmitter remains silent. 

Rowid l(reverse): The receiver sends the training signal using power SNR, Xf^i = \/SNR /3, where /3 is a known 
fixed sequence. The signal received at T is Yj ^ = HjXf i + W, which is used to form an MMSE estimate Hi. Let the 
eigenvalues of HiH\ be Ai,j = SNR""^ '. Further, the transmitter will save a quantization index of the channel which 
is an estimate based on the received power and is labeled at the end of round w — 1. To state the recursion, 

we assume qi = \. 

2) Round M e {2,K — 1} (forward): The transmitter trains the receiver with = P„_i(il„_i)/? where the power level 
P{Hu-i) is chosen as follows 

- fO, if qu-i = 

^--(^--^ = U^3,,-..^^.-^,.„-,. Otherwise. ^''^ 

Note that for power constraint, SNR in the numerator can be replaced by SNR^"''^"*" for 5 very small and that will not 
affect the analysis as was seen in the proof of Lemma 1. 

If w = 2, the receiver estimates the power controlled channel G2 = \J Pi{Hi)H and decides an index 52(^2) as 

= ''^^^"^ (23, 

II, Otherwise, 

where O2 = {G2 : dct(/ + G2GI) < SNR'^+'=}. 
For u > 2, the receiver's actions can be described as follows. 
. If = 0, 9„(g"-SG„) =0. 

• If qu-i = 1, a MAP power estimation is done by the receiver. If the power <SNR'/^'"", g„ = 1. Otherwise, the 
estimates the power controlled channel G„ = sJp{H^_i)H, and 

,.(,-',G„) = |°- (24, 
II, Otherwise, 

where 0„ = {G„ : det(/ + G„Gt ) < SNR'^+^j. 
Round Me {2,K — 1} (reverse): The receiver sends the signal 

_f0.,,3/. g„ = 

WSNRi+^"('-+^)/J/, g„ = l' ^^^^ 



receiver 
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where /?/ is a known sequence. The signal received at node T is = HfXf^u + Wf. 

The transmitter estimates the index sent by the receiver using estimate of the received power with a threshold of SNR*^/™". 
If the received power >SNR^^™", then = 1 else g„ = 0. If = 1, the transmitter estimates the channel as Hu-i- 
Let the eigenvalues of Hu-iH\_-y be \u-i,i = SNR""""^'*. 
3) Round K (forward only): The transmitter trains the receiver and then sends the data both at a power level of 



Pk{Hi,- ■ ■ , Hk-i) = SNR™'''^'^"'"^^! '"'"^^*~"^"'"'"~"*'^"^'*'™'*^"Sl^'^-il^««=^ l+EJ^i (2i-l + |n-m|)a«,i) ^26) 

Theorem 5. If the iterative TDD protocol is used for K >2, following diversity -multiplexing tradeoff can be achieved 



d(K-l).5 



mn '^'Z'l ,^ - (mn)^-'^(m + n-l)r mn > 1 
K — r mn = 1 



Proof: We will show in Appendix F that diversity of mn(l + WK-i{r + e)) can be obtained, which for e — > converges 
to the statement of the Theorem. ■ 

Remark 5. As a special case of the theorem, for < r < min(m, n), diversity of mn+G{r, mn) = mn{mn+l) — {m+n—l)r 
can be obtained in the TDD system with the 1.5 rounds of training and feedback. In FDD system, the transmitter learns constant 
number of bits about the channel estimate of the forward channel (channel from the transmitter to the receiver) in the first 
round. However in the TDD case, the transmitter knows the forward channel estimate (due to reciprocity in the two channels) 
with larger resolution. Hence, even though the transmitter remains silent in the first round, it receives more information in the 
first round. From the point of view of the receiver, only the power-controlled channel estimate in the second round is used for 
decoding the data since it has atleast the same information about the channel (asymptotically) as the first round of training 
and hence the first training is neglected for decoding. Hence, the TDD protocol performs better than its FDD counterpart. 
However, the TDD strategy results in the same diversity multiplexing tradeoff performance for a SIMO/MISO system as the 
FDD strategy when the number of feedback levels K ^ oo. 

In the iterative TDD protocol, we used a training symbol from the transmitter in each round. However, we could have used 
one bit of data from the transmitter in each round (since the protocol is receiver initiated, this is like the quantized feedback 
from the transmitter.). One bit of quantized feedback can be used to achieve a diversity of G/i-(r). Although G/f (r) is smaller 
than what we achieve with this iterative TDD protocol, it takes less time to send one bit of data (one channel use) than to 
train (m channel uses). Thus when accounting for the resources used in training and feedback, an optimization over the use 
of a quantized feedback symbol or the training symbol in each round of conununication would also be required. 



C. Discussion of TDD Protocols 

Like in the iterative protocol for FDD systems, we extend the TDD protocol to an iterative protocol where higher powers 
are used rarely leading to arbitrarily large gains in diversity with increase in communication rounds. The TDD protocols use 
reciprocity in addition to the asymmetric properties of the power control. Due to the channel symmetry of the forward and the 
backward channels, the receiver will be able to find whether the transmitter made an error in understanding the power level. 
Thus, the channel symmetry can be used to increase the diversity order beyond that is achievable using FDD protocols. 

Channel reciprocity enables the receiver to detect if the transmitter has made an error in understanding the feedback signal 
sent by the receiver. Thus, if the receiver indicated that the channel was Bad in the previous feedback phase but the transmitter 
understood it as Good, then transmitter's actions will allow the receiver to detect this error with high probabihty. In this case, 
the receiver can re-send its feedback signal with higher power. 

To understand the specific operation of the protocol, we will use the pictorial view depicted in Figure 5 for a SISO system 
with 3.5 rounds of TDD protocol. After the forward phase of second round, the receiver knows if the channel is Good (outside 
the circle in Figure 5(a)) or Bad (inside the circle), and indicates its first feedback as 92- Since the feedback channel is same 
as the forward channel, the transmitter can make an error in understanding the feedback about only the Bad channel ((72 = 1). 
That is because when the forward channel is Good, so is the reverse channel. Thus, the feedback in this case is received with 
exponentially small error probability. On the other hand, when the forward channel is Bad, indicated by q2 = 0, so is the 
feedback channel. In this case, the transmitter will make errors ($2 = when q2 = 1) if the actual channel was inside the 
smaller circle of Figure 5(b), while not make errors in the shell between the two circles since the channel is not that bad 
(§2 = 1 when q2 = 1). In the inner most circle, since 92 = 1, the receiver knows (with exponentially small error probability) 
that the channel is Bad but the transmitter send no more training assuming it is a Good channel since it thinks that no more 
channel resolution is required. The receiver can detect the erroneous training power and realize that the transmitter made an 
error. Detecting an error, the receiver will re-send the same information as 53 = 1 but with higher power to ensure safer 
delivery. For clarity, we step through each phase of the protocol. 

In the first round, the receiver sends a training to the transmitter based on which the transmitter finds the estimated channel. 
Using this estimated channel, the transmitter trains the receiver in the forward phase of second round. With this training, the 
receiver finds the regions where this power level will be sufficient to avoid outage. It sends q2 = when the channel is Good 
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(a) (b) (c) 

Fig. 5. Channel events (a) in the second round (forward), (b) in the second round (reverse) and (c) in the third round (forward). 

(the event that the channel is not in outage based on the estimate of power-controlled channel) while q2 = I when it is Bad 
(not Good). Thus, the channel space is divided in two parts at the receiver. This is depicted in Figure 5(a). The receiver sends 
this index to the transmitter and due to the asymmetry in the feedback errors, Good channel state is received as Good state 
(§2 = 0) with probabihty 1. However, the Bad channel state iq2 = 1) may be mistakenly understood as Good channel state 
(§2 = 0) by the transmitter. Hence, as shown in Figure 5(b), the transmitter receives the index in error inside the inner circle. 

In the third round, the transmitter sends a training symbol if it understood that the channel is in Bad state (92 = 1) which 
is in the shell between the two circles. Note that no training is sent in the iimer circle where the transmitter received incorrect 
feedback. Since outside the outer circle, the receiver has resolved the channel as being Good, it continues to inform Good 
channel state (53 = 0). In the interior of the inner circle where the transmitter was mistaken, the receiver will be able to know 
that the transmitter made an error (This is because the no training is sent and the power-controlled forward channel wiU be in 
outage, i.e. log(l + GG^) < SNR''"'"^ where G is the power-controlled channel estimate, with probabihty 1). This is due to the 
symmetry which allowed receiver to know that the transmitter has made an error and the channel is Bad. Hence, the receiver 
informs the transmitter that the channel is Bad (q^ = 1) in this region. However, when the receiver sent Bad channel in the 
previous round which is correctly decoded as Bad at the transmitter (52 = 92 = 1), the receiver will make an estimate and 
decide if this power level is enough to avoid outage. Based on this, it classifies the region into Good (^3 = 0) or Bad ((73 = 1) 
as can be seen in Figure 5(c). Outside the outermost circle, the transmitter receives Good ($3 = 0). In the shell between the 
two circles also, the transmitter receives Good ($3 = 0) due to the asymmetry in the feedback errors. However, the inside 
circle would be divided in two parts since the transmitter may mistake the Bad channel state (^3 = 1) as Good channel state. 
The inner most would be received as Good (gs = 0) while the outer part will be received as Bad ($3 = 1). Thus, we see 
that the region corresponding to the uncertain part at the transmitter is always a circle centered at the origin and whose radius 
keeps on shrinking with more rounds of the iterative protocol. 

There are two main outage events in this protocol. The first outage event is the channel state being in the inner most circle 
where the transmitter received Good ($3 = 0) since lower power would be used for transmission than is needed to avoid 
outage. The second outage event is the area that the receiver classified as Bad (53 = 1) and the transmitter correctly decoded 
it as Bad {qs = 1) and the higher power level used by the transmitter is insufficient to avoid outage. Since both these outage 
events can be represented as one inner-circle which shrinks with increasing rounds, the resulting diversity order increases in 
accordance. 

Note that both FDD and TDD protocols have the element of using information collected in previous rounds to form the next 
feedback signal. This memory in encoding channel state information is critical to adaptive zooming into the actual channel 
state. As apparent from the above discussion, the protocol zooms into the poor channel states on-demand, and at the same 
time tries to keep both the transmitter and receiver ahgned in their information about the channel. 

V. Conclusions 

In this paper, we derived the diversity tradeoff for a FDD system and a TDD system in which the errors in channel estimation 
and the feedback channel are fully accounted. This paper finds the diversity multiplexing tradeoffs when the receiver and the 
transmitter are allowed to exchange multiple rounds of messages. The diversity multiplexing tradeoff increases with each round 
of message passing between the nodes increases the achievable diversity multiplexing. This paper gives a way of resolving 
errors with more rounds of message passing in a system where the errors in training and feedback are considered. 

The two models, FDD and TDD, considered in this paper are the two extreme cases of the correlations between the forward 
and the backward channel. As a next step, one can consider what happens if the forward and the feedback channel are correlated, 
but not identical as in our TDD protocol. When the channels have any constant correlation, the results of FDD system will 
apply since any constant correlation will make the two channels look independent in high SNR regime. Further analysis to 
consider the case when the correlation between the forward and the backward channel p that satisfies 1 — = SNR~" for 
any u > would be a transition from the FDD approaches to the TDD approaches on increasing u, and is still open. 
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Also, this paper assumes a Rayleigh fading channel model. The authors of [27] consider a general model for fading which 
includes Rayleigh, Rician, Nakagami and WeibuU distributions to find the diversity multiplexing tradeoff for a system with no 
feedback and perfect channel estimate at the receiver. The extension of the feedback cases to general fading models is still 
open. 
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Appendix A 
Technical Preliminaries 

1} Distributions of eigen-value exponents with training: Suppose that the actual channel is H. Let that eigenvalues of HH^^ 
be (Ai, • • • , Atom), where rriN = min(m, n). Without loss of generality, we will use tun = Tn throughout this paper. For more 
generality, m can be replaced by min(m, n) and n by max(n, m). Further, let A^ = SNR~"* and a = (ai, • • • , amiv)- Th^ 
distribution of aj's is given as follows. 

Lemma 2. [25] Assume ai > a2 > ■ ■ -aniM power exponents as described above. In the limit of high SNR, the 

probability density function of the SNR exponents, a, of the eigenvalues of HH^ is given by 

p{oc) = n^«SNR-(2'-i+l"-"l)"'l^i,(„)>o. (27) 

Now suppose that the actual channel H is estimated via a training symbol at a power level of SNR^. The MMSE channel 
estimate is denoted by H. Let the eigenvalues of HW be (Ai, • • • , A^n)' \ = SNR^"' and a = (Si, • • • , 

Let ai> a2> ■ ■ ■ ^i^d Si > S2 > • • • Sm^ (Note that throughout the paper, this ordering will be assimied without 
loss of generality). Further, we define 

Ek = {{a, a) : min(Q;i, S,) > p Vi = 1, • • • , fc, and < a, = S^ < p V i > fc} (28) 

for aU < fc < mjv- 

Lemma 3. Let H be the channel and H be the estimated channel using training power o/SNR*'. In the limit of high SNR, 
the probability density function of the SNR exponents of the eigenvalues of HW and HW is given by 

p{a,a) = ^CfclB, (29) 

k=0 

where ai> a2> ■ ■ ■ ccmj^, Si > S2 > • • • S^j^, and 

_ 5|yjpfcp(|n-m|+fe)jjfe_^5|\|p-(2i-l+|n-m|)aijjmiv5|y|p-(2i-l+|Ti-m|)ai^ 

Proof: This lenmia was proved in [6] for p = 1 and is straightforward to generalize for any p> 0. ■ 
Corollary 3. For the case ofm = n = 1, p{a,a) = SNR~"lo<a=a<p + SNR^~'*~"li„in(a,a)>p- 

Note from above that if < p,a.i^ ai with = probability. Thus, the eigen-value exponent can be assumed to be correct 
based on the training if it is < p. However if it is > p, then the value of ai and Sj have no relation except that Sj > p. Thus, 
only limited information can be extracted. This is because the estimation noise becomes dominant. 

A novel scheme called power-controlled training was suggested in [4] which takes care of the estimation error in training 
by estimating the power-controlled channel. If the actual channel is H and the transmit power is SNR^^. Suppose that the 
receiver do not know pj. The receiver makes an estimate of H. With this estimate, the estimation error given by 

trace will be at-most of the order of constant. This is because the channel estimation error in H, H have 

complex normal entries with mean and variance SNR^^. Thus, let eigen-values of SV\R^^ HW be A^ = SNR^"' where on 
have = probability if < as given before. Then, trace (SNR^^ if fft) = SNR"""<SNR°, Thus, the estimation error is at 
the noise floor. For more details, the reader is refered to [4,6]. This scheme will be used in all the protocols in this paper. 

2) Power Controlled Feedback: We will use the concept of power-controlled feedback [6] throughout the paper. For 
illustration, we will focus on a case of distinguishing between two feedback levels. Consider feedback levels a and b are 
transmitted from the receiver using power levels of SNR^^ and SNR^^ respectively. Without loss of generality, assume that 
Pi <P2- The transmitter will observe the received power and find if the receiver sent a or 6. As in [6], if the receiver transmitted 
a symbol at SNR''', the received power is = SNR*^^*""™-* where is the smallest eigen-value exponent of the channel as 
was defined in the last subsection. We will use the threshold detection at the transmitter. In [6], threshold of SNR^^^'^ was 
proven to be optimal for arbitrarily small 5 > 0. To see this, we consider the error events at the transmitter: 

1) n(Transmitter received a|Receiver transmitted b) = U{p2 -am<Pi + S) = SNR-'""^^^-^!-*^ 
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2) n (Transmitter received 6|Receiver transmitted a) = n(pi — am > pi + S) = 0. 
Since the second event happens with probability = 0, ^ is chosen arbitrarily close to to get the optimal detection at the 
transmitter. 

Appendix B 
Proof of Theorem 2 

We will first verify that the average power constraint is satisfied. 

1) Round 1: For the feedback, cm < 1 and thus (cm — G„(r + e))"*" < 1. Further, 

n ((7i(Hi) = K-lj = n (det(7 + ^i^|SNR^+f^-=) < SNR''+') = SNR-°(''+''^+f^-=\ Thus, the power con- 
straint in feedback is satisfied. 

2) Round 2: To observe the satisfiability of the power constraint, we will find the probabilities of gi(-ffi) and 5i- 
We note that the probability of qi{Hi) is bounded as: 

n (gi(^i) = u) < SNR-™('^»(''+')''="' = SNR-f" ifu<K-l (31) 

n (gi(^i) = K-1^ < SNR-'^(''+^'^+f^-=' (32) 

The first term is because if G„_i(r + e) < Cm then 11 (qi{Hi) = uj = SNR'^"(''+''\ but if G„_i(r + e) > Cm then 
n (qiiHi) = - 0. 

Since the received power cannot have a larger SNR exponent than the transmitted power asymptotically, 

n (gi = if - 1) = n (gi(^i) = if - l) < SNR-^(''+''^+f^-=\ (33) 

For < u < if - 1, 

n ($1 = w) = n (51 = u, qi (ill) > u) + n (gi = u, qi {Hi) < u) (34) 
< SNR-f" + n ($1 = u, qi{Hi) < uj (35) 

The last step follows since 11 (qi{Hi) =U2>uj <SNR"*'''. Thus, 

n {qi =u) < SNR-P" +Il(qi = u, qi{Hi) < (36) 

u-l 

< SNR-f« + ^n(§i = w|9i(ffi) = i)n(gi(ffi) = i) (37) 



i=0 
u-l 



< SNR-^'" + ^n($i = u|5i(ffi)=i)SNR-f* (38) 

i=0 
u-l 

< SNR"^" + SNR~'""(^'^'^~^'^''"'"'^^^^~^°"~'^"^''"'"^^^^~'^/'"")SNR~^' (39) 

u-l 

< SNR~^" + ^ ] g|\jp-mn(min(cM,G„(r+e))-Gi(r+e)) + +(5-min(eM,Gi(r+e)) {40) 



i=0 

u-l 



< SNR-P-' + ^SNR-^P"-'^) (41) 



=0 



< SNR-(f«-*'"^ (42) 

Hence, the power constraint is satisfied. We will now use ^ w to compute the outage probability of different events. 

1) dct(/ + iiii^ SNR) > SNR'^. In this case, any power level is sufficient, and hence there is no outage. 

2) For 1 < u < if - 1, dct (^/ + i/i7tSNRi+'"i"('=M'G„_i(r+.))^ ^ ^^^r 

det (j + iJiJtSNRi+™"('="'^" > SNR^ We wiU first show that in this case qi{Hi) > u. Let the eigenvalues 
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of and HiHt be SNR""' and SNR""' respectively. Note that 

n (det(7 + fl"ii"tSNR^+™("*^'°''-i(''+^») < sm\ 

det(/ + ij^^ts|S|Rl+min(c„,G„_i(r+e))^ > SNR''+^) (43) 

(mm \ 
^(1 + Pu-1 - ai)+ < r, ^(1 + pu-i - ai)+ >r + ej. (44) 

If r < m — 1, = and thus ^(l +Pu-i — cci)^ = +Pu-i — cf^)^ making the above probability = 0. 

For r > m — 1, first consider that atleast A; > 1 a^'s are > 1 (or, ai, • • • , afc > 1). In this case, 

m 

^(l+p„_i-ai)+ < kpu-i + {m-k){l+pu-i) (45) 



i=l 



< kcM + {m - k){l + Cm) (46) 

< mcM + m — k (47) 

< r-(m-l) + m- l (48) 

< r. (49) 

Thus, + Pu-i — > r + e can never happen. Thus, only possibiUty is that all aj < 1 in which case 

X™,(1 + pu-i — ai)"*" = jy^ii^ + Pu-1 ~ oii)~^- Since gi(iJi) > u, qi > w, and thus there is no outage in this case. 

3) det (/ + i7iltSNRi+™"("*^^^^-^(''+^») < SNR'' and det (/ + 77i?tSNRi+"""(<=^-i('^+<^)'<=('''i+""») > SNR''. As 

before, gi(-ffi) K — 1 with probabihty = 0. However, qi can now take a lower value. Thus, the outage happens 
whenever any less power is received at the transmitter. This happens when the power of SNR^"'"'^^''"'"^'^"'"^^"^^ is received 
below SNR""+'^/'"'' which happens with probability SNR-"'"(i+^(''+<^'1+p^-^)-"*^)+'^. 

4) det (/ + iJijtSNRi+'"'"(^^-i(''+<^)'^('''i+'^"») < SNR''. This happens with probability 

5|\|p-G(r,l+min(GK-l(r+e),G(r,l+CM))) 

Thus, the overall outage probability is bounded by 

11(0) < 5|\|p-'""(l+G(r+e,l+pft:_2)-CM)+i5 _j_ 5|\jp-G(r,l+min(Gsr_i(r+e),G(r,l+CM))) (50) 
= S|\|R-"»"(l+G(r+e,l+pA:_2)-CM)+i5 _|_ gjyjp- inin(G(r,l+GK_i(r+e)),G(r,l+G(r,l+CM))) (51) 

For r < m— 1, pk-2 = cm = and thus, mn{l+G{r+e, 1+Pk-2)—cm)+S = rrm{l + G{r+e, l))+6 > G{r, l + G{r+e, 1)). 
Thus, choosing e and 6 close to 0, diversity of inin(Gi<-(r), G(r, 1 + G(r, 1 + cm))) can be obtained. 

For r > m — 1, Let d = G{r + e, 1 + pk-2)- We will show that mn{l + d — Cm) > G{r, 1 + d). 'With this, the diversity 
will reduce to min(GK(r), G(r, 1 + G(r, 1 + cm)))- 

To see mn{ 1 + c? — cm) > G{r, 1 + d), note that for m = 1, left and right sides are both n{l + d — r) and thus the inequality 
is satisfied. For m > 1, 

G{r,l + d) < G(l,l + (i) (52) 
= mn(l + d)- {m + n-1) (53) 

< mn(l + d)-n (54) 

< mn(l + d) — mncM (55) 



Appendix C 
Proof of Theorem 3 

"We will first verify that the power levels used in the protocol satisfy the average power constraints. 

1) Round 1: Transmitter trains receiver with power SNR, and hence the average power constraint is satisfied. The receiver 
uses a maximal power level of SNR and hence the average power constraint is satisfied. 

2) Round 2<u+l<K-l: 

n($i = 1) = n($i = i,qi{Hi) = 0) + n(jTi = i,gi(^i) = i) 

< n(?i = i|(zi(^i) = 0) + n(gi(^i) = 1) 

< SNR-'"''+' + SNR-°^(''+'^ 

= SNR-^i(''+^), (56) 
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where we used Il{qi ^ l\qi{Hi) = 0) = SNR"""+' since the threshold is at snr-^/"" and by [6]. Further, U{qi{Hi) = 
1) = det(/ + HiHlsm) < SNR'^+" = SNR-^^(''+'^ The last step follows since Gi(r + e) < Gi(e) =mn- e(m + 
n — 1) < mn — e. 

Hence, the power constraint is satisfied at the transmitter for m = 1. 



n(g„+i = 0) = n(g„+i = o,q^ = o) + n(g„+i = o, g„ = i) 

= n(g„+i = 0, (7„ = 0, qu = 0) + n(g„+i =0,qu = 0, g« = 1) 

+n(q„+i = 0,qu = = 1) 
= ^{Qu+i = 0, g„ = 0, g„ = 1) + U{qu+i = 0, g„ = 1, g„ = 1) 

< n(g„ = l) 

< SNR-^"('^+') (57) 

where we used three things: 

a) n(q'„+i = 0,g„ = = 0) = 0. If u = 1, n(g„ = = 0) = 0. For this consider two cases. If = for 
some i < u, then as g„ = 0, = 1 since the power estimate > SNR*^ happens with probability = 0. Further, if 
qi = I for all i < u, g„ = happens with probability = since any transmitted 1 from the receiver gets received 
as with exponentially small probability. 

b) n((7u+i = 0,qu ~ 0, qu — 0) = 0. This is because when transmitter sends at power, the receiver receives at 
power > SNR^ with exponentially small probability. 

c) n(§"„ = 1)<SNR~'^"^'^''''^^ which has been earUer shown to be true for u = 1, and we will show in general by 
induction. 

Hence, the average power constraint at the receiver is satisfied for all u. To check the induction argument for g„ and 
transmit power constraint, 

n(g„+i = 1) = n(g„+i = 1, = 0) + n(g„+i = l, q^+i = l). (58) 

For the first term n(g„+i = 1, g„+i = 0). This happens when trace(//'i/'t)SNR^+'^"(''+') + 1<SNR'/™" which happens 
with probabihty SNR-'""(l+^"(''+^»+^ 

The second term can occur only when all the qi to qu+i = 1, and all to qu+i = 1. The reason for all qi to qu+i = 1 is 
because if any of = all gi+i = qu+i = since the forward channel was good and cannot be bad in the next roimds. 
This happens when det(/ + #„+i^i+iSNRi+'^"('^+'^)) < SI\IR''+', which happens with probability SNR~^"+i(''+'\ 
Note that 

G„+i(r + e) = G(r + e,l + G„(r + e)) 

< G(e,l + G„(r + e)) 

= mn{l + Gu{r + e)) — {m + n — l)e 

< mn{l + Gu{r + e)) - €. (59) 

Hence, 

n(§-„+i = l) < SNR-^"+i('-+^). (60) 

Hence, the induction steps hold, and the power constraint is satisfied. 
3) Round K (forward only): 

n(gi = 1,52 = 1) = n($-i = 1, $2 = 1, 92 = 0)+n(gi = l, $2 = 1,91=0,92 = 1) 
+ n($i ^ 1,92 = l,9i = 1,92 = 1) 
< n(9i = 1,92 = 1,92 = 0,91=0) 
+n(9i = 1, 92 = 1, 92 = 0, 91 = 1) 

+n(9i = 0, 91 = 1, 92 = 1) + n(92 = l|9i = 1, 91 = l). (61) 

The first term 91 = 1, 92 = 1, 92 = 0, 91 = happens asymptotically with probability that there is error in the reverse 
channel both times. This happens with probability SNR"'""^^+'^^ 

The second term qi = l,q2 = l,q2 = 0,qi = 1 happens when 91 = 1 and when there is error in the feedback channel 
in the second round. This happens with probability less than SNR"""^^+'^l(''+')'+^ 

The third term qi = 0,91 = 1,92 = 1 happens with exponentially small probability since the probability det(/ + 
HiHlsm) > Sm''+' and trace(il2^f2^SNR^+'^i(''+'') + 1 < Sm' is = 0. Note that term of Hi and H2 are 
correlated with correlation pij s.t. 1 — pf j = SNR~^. Let the eigenvalues of H\H\ and H2H2 be Ai, • • • , Ajnin(m,n) 
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and Hi, - ■■ jMmin(m,n) respectively. Let Aj = SNR"* and /Xj = SNR 

n (det{I + HiHlsm) > SNR'■+^trace(i^2i^|SNR^+^l(''+')) + 1 < SNR') 
= n(^^(l -a,)+ > r + e, (l + Gi(r + e) -minai)+ < 
= n(^(l-Si)+>r + e,(l + Gi(r + e)-minai)+ < 

= (62) 
The fourth term in right hand side of (61), 

n(g2 = l|ai = l,?i = l) = SNR-«=('^+^). (63) 

Hence, (61) reduces to 

U{qi = 1, §2 = 1) = SNR-""(i+^i('^+^»+^ + SNR-^^('^+^) 

= SNR-^^('^+<^). (64) 

Similarly, we see that for w > 2, if 5i = • • • = $u = 1- Then, if any of g„ = z for some i < u, then Qu = 0. 
Also, the feedback channel transforms g„ = to = 1, which happens with probabihty SNR~'""^^+^"-'^''+^^^+^ 
However, if = 1 for all i < u, this happens when logdet(/ + i?„Fl;SNR^+'^"-^^''+'^) > SNR''+' which happens 
with probability SNR"^"(''+'\ Hence, the event §i = • • • = g„ = 1 happens with probabihty = SNR"^"(''+'\ Hence, 
the power constraint is satisfied. 
We will now show that the outage probability is = SNR"'^^'''^"'"'^^-^^'""'"*^^ For this, we see the region of H when there 
will be outage. 

1) det(/ + ifJJ^SNR) > SNR*^. In this case, power of SNR is sufficient, while we will be using a higher power level. 
Hence, there can be no outage. 

2) dct(/ + ffiJtSNR) < SNR'' and dct(/ + HH^Sm^+^^^''+'''^) > SNR''. In this case, qi = 1 with probability 1. 
The reason is that 11 _ ai)+ < — > r + ej =0 (where a-i and ai are the negative 

exponents of the eigenvalues of HH^ and HiH\). Since gi = 1, 5i = 1 with probability 1. Since 5i = 1> power of 
atleast SNR^"''"^ is used and hence no outage in this range. 

3) For ue [2,K - 1], dct(/ + iJHtSNRi+^"-i(''+<^)) < SNR'' and det{I + HWSm^+^^^''+''^) > SNR''. As before, 
qr = q^ = 1. Now, since training is at SNR1+^i(''+^\ H + Gi(r + e) - < r, 



X^i^i*' (1 + Gi{r + e) — Si)^ >'r + e^ =0 (where Ui and Si are the negative exponents of the eigenvalues of 

HH^ and H2HI). Hence, 92 = 1 which implies §2 = 1 with probability 1. Similarly extending all till = = 1 with 

probability 1. Hence there is no outage in this range 
Hence, the outage happens only when dot (/+i7i7tSNR^+<^" (''+')) < SNR'' whichhappens with probabihty SNR-'^('''^+°^-i(''+')\ 
For e very small, diversity of Gif (r) can be achieved. 



Appendix D 
TDD: Perfect CSIR or Perfect CSIT 

In this Appendix, we consider two cases. The first case assumes perfect channel state information at the transmitter, where 
only receiver is unaware of channel conditions. Second, we consider the case of perfect information at the receiver, with no 
channel information at the transmitter. In both the above cases, infinite diversity order can be achieved with 1.5 round protocols 
which is unlike the case of FDD systems[6]. 

For the case of perfect channel knowledge at the transmitter and receiver, it is well known [1] for the case of min (m, n) > 1, 
zero outage probability is possible at a finite SNR value and thus the diversity order is unbounded. For the case of single 
antenna system, i.e min (to, n) = 1, channel inversion leads to an exponential decay in probability and hence also has an 
infinite diversity order. We show in the next two results that infinite diversity order can be achieved even if only one of the 
nodes, transmitter or receiver has perfect channel information. 

We begin the discussion with the case when the channel information is perfect at the transmitter and receiver relies on 
estimated channel knowledge. 

Lemma 4 (No CSIR, Perfect CSIT). For all < r < min (m, n), an infinite diversity order is achievable with a 0.5 round 
protocol. 

Proof: The protocol proceeds as follows. 
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Round 1 (forward only): The transmitter trains the receiver with a power level 



/ mn 



P{H) = 



-(2i-l+|n-m|)ai 



where A; = SNR~"' are the eigen-values of the channel realization H at the transmitter, and sends data at the same power. It is 
straight-forward to conclude that the power constraint will be satisfied, and the outage probability = for r < min (m, n) — e. 
(By choosing e small enough, the above holds for r < min (m, n).) Note that the power control from the transmitter is only 
asymptotically satisfied. ■ 
Since the transmitter knows the channel perfectly, it can "invert" the channel exactly for the case of min {m, n) > 1 and 
in this case, the receiver sees a channel whose SNR does not fluctuate and is always above the threshold needed to avoid the 
outage. Similarly, for the case of no information at the transmitter but full information at the receiver, we can achieve an error 
probability decay which is faster than any polynomial decay in SNR. 

Lemma 5 (Perfect CSIR, No CSIT). For < r < min (m, n), an infinite diversity order is achievable with a 1.5 rounds 
protocol. 

Proof: The protocol proceeds as follows. 

1) Round 1 (forward): The transmitter remains silent. 

Round 1 (reverse): Since the receiver knows the channel H, it finds the index of the power level from i <£ [^tK] that 
would be sufficient to avoid outage as in [8]. Let Aj = SNR~"* are the eigen-values of H with ai > a2 > • • • > am- 
Further, the receiver communicates this power level to the transmitter using a power level of SNR"'"''" to communicate 
index i. The transmitter estimates the received power and hence gets the index i with error probability = 0. 

2) Round 2 (forward only): Since the transmitter now knows the power level to use, it sends data at the required power 
level. 

First note that the probability with which the power level of SNR"™"*"^ is used is <SNR~"'" and hence the power 
constraint is satisfied. The only step to prove is to show that the probability with which the transmitter will make a mistake to 
decode the index transmitted by the receiver is = 0. Suppose that the transmitter uses a threshold for level i at SNR "^^^ 
Since the transmitted power is SNR""'"'"^^, the received power is = SNR^^+^SNR""" = SNR^. Here, we exploited the 
reciprocity in the channels since am for the forward and the backward channels are the same. Since there is nothing random 
in the above exponent of SNR in this received power (asymptotically), using the above threshold, the index be mistaken with 
probability = 0. 

The above strategy attains the diversity of Gk {r) for any finite K. Since Gk {r) monotonically increases with K for 
< r < min (m,n), for any given a; > diversity > x can be achieved by choosing K large enough. Thus, the above 
protocol yields unbounded diversity. ■ 

Appendix E 
Proof of Lemma 1 



The probabiUty that the channel cannot support rate R is given by 
n (logdet(/ + P{Hi)HH^) < R^j 



' inin(m,n) min(m,n) 

= n| E (1-^^+ E (2»-l+|n-m|)Si-ai)+<r 

i=l i=l 



p(a, a)dada 

'A 

min(m,n) 

= ^ / SNR*^(l"-'"l+'=)n,iiSNR-(2'-^+l"-'"l)"*n^^SNR-(2'-^+l"-'"l)"*dada, (65) 

where ^ = {a, S : (l - + (2z - 1 + |n - m|)S, - a,) ^ < r 

We first note that AO = 'S> for k < min(m,n). Let m < n, other case is similar. For any Eh where k < min(m, n), 

< ai — cii < pVi > k and min(Q;i, ai) > pii = 1, • • • , fc. Thus, 
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min(m.n) / min(m,n) ^ ~^ 

e 

\n — m| ]ai — on 

1=1 \ «=i 



i=\ \ «=i 

fc / min(7n,n) 

= E — + E (2i-l+ |n-m|)ai-ai 
»=i \ »=i 

m / k n 

+ V I 1 - — + V(2i - 1 + In - m\)ai + V (2i - 1 + In - m|)ai - a, 

i=k+l \ J=l i=fe+l 

m / k m \ 

> E ( 1 ^ ~ + E(2* - 1 + |n - m|)ai + ^ (2i - 1 + |n - m\)ai - ai j 

i=fc+l \ i=l i=fe+l / 

TO / k \ + 

> E i-;i: + E(2^-i + l"-H)sJ 



+ 



+ 



mn 

i=fe+i \ i=i 

m / k 

e 

ml 

mn 

i=fe+l \ i=l 



+ 



> y 1 — +y(2i-i+|n-' 

i=fe+l \ i=l / 

= (m-k)i\ ^ + A;(fc+ |n-m|)) 

V mn / 

> (m - fc) f 1 + - — ) (66) 

V mn/ 

For fc = 0, above is > m(l — e/mn) and for any r < m — e, the above cannot be less than r. Similar for A; = 1, m > 1, above 
cannot be < r. For fc > 1 we can lower boimd ~ ^ ~ ^ by (by choosing e < 1). Thus, 

min(m,n) / min(m,n) 

e 



E E (2i-l + |n-m|)a,-a, 



> (m - fc) f 1 + fc' 

V mn/ 

> (m-fc)(l + fc) 

= m — fc + mfc — fc^ 
= m + (m — 1 — 

> m (67) 



Hence, 



n (logdet(/ + P{Hi)HH^) < i?) 

= / SNR'"f("-'"+'")n™iSNR-(2'-^+"-'"'"*n™iSNR-(2'-^+"-"'"*rfarfa 

JAnE^ 

= [ SNR""Pn™iSNR-(2*-i+»-'")3.n^iSNR-(2*-i+"-'»)«'dada 
JAnEm 

Hence, the negative of the SNR exponent of the above probability is 



(68) 



min y^(2i — 1 + n — m)(ai + aA ~ mnp. (69) 

(a,a)eAnE,n ^ 
1=1 

The optimal a, = p. Now, let ai = p + (3i. The above negative SNR exponent is given by 

m 

mnp + min >^(2i — 1 + n — m)di (70) 

(/3i,-,/3™)eB^ 

z— 1 

where S = {(/?i, • • • ,/3m) : A > 0, + (w^ — Pi)^ < the definition of function G, the above reduces 

to 

mn» + G(r, 1 + (mn — 1)» ). (71) 

mn 
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For e ^ 0, the probabUity that the channel cannot support rate R is thus SNR-^^p-cCr.i+Cmn-ijp- 



Appendix F 
Proof of Theorem 5 

To verify that the power constraint is satisfied, observe the following. 

1) Round 1: Receiver trains the transmitter with power SNR, and hence power constraint is satisfied. 

2) Round 2: The transmitter trains the receiver with a power level PiiuA = „^ — _(2^i-i+|„-„|)ai ■ ^ and the average 
power constraint is satisfied as was shown in single round. 

n = 1) = n (det (/ + GzG^) > SNR''+'^) (72) 
< SNR-^=(''+^\ (73) 

where the second steps follows from Lemma 1 with p = 1. 

3) Round u = ^:K-l: 

n(g2 = i) = n($2 = i,'72 = i) + n(§2-i,(Z2 = o) (74) 

< n(g2 = l) + n(g2 = l|g2 = 0) (75) 

< SNR-^^('-+^)+e-s^'R' (76) 
= SNR-^^(''+^) (77) 

Further, the training power from the transmitter satisfies the power constraint by the same arguments as in the single 
round, 

n(g'« = l) = n(g'„ = l,g;j_i = l.g'u.i = 1) +n(g'„ = l,g;j_i = O.g'u-i = 1) . (78) 

Let us first consider the first term. As = 1 is received as qu-i = 1, ctm < 1 + Wu-i {r + e) — Since a^n 
is less than the SNR exponent of the power at which the training symbol was sent, G„ G with probability = by 
Lemma 1. Also, the power estimation cannot fail since = Oim- Thus, the first term is = 0. 

The second term is upper bounded by 11 = Q\qu-i = 1) whichhappens with probabihty SNR~'""(^+^''-i(''+^^^+' = 

SNR-^"('-+^). Hence, 



n(9„ = l) = n(g„ = = l,^^.! = 1) + n(g'„ = = O,^^.! = 1) (79) 

< SNR-^''(''+^\ (80) 

The power constraint at the transmitter is satisfied since the average power is bounded by SNR. 
4) Round K (forward only): The transmitter trains the receiver and then sends the data both at a power level of 

|g|\jpmax(l+X;™*"*''*'(2i-l+|n-m|)ai,i,max„g[2,s:_i],j„=i (l+5Z™?'"''"''(2«-l+|n-'n|)S„,j)) j-g]^-) 

Note that the power constraint is satisfied since for any maximizing term in the power, the probabihty that that particular 
q=l would balance the exponent. 
We will now show that the outage probability is = mn (1 + Wk-i (r + e)). For JsT = 2, it follows directly by Lemma 1 for 
p=l. We now assume that K > 2. For this, we see the region of H when there will be outage. 

1) det (/ + HH'fPi (^i)J > SNR'^. In this case, power of P (^Hi^ is sufficient, while we will be using a higher power 
level. Hence, there can be no outage. 

2) dot (^I + HH^Pi (^i)) < SNR'' and Q!,„ < 1 + 14^2 + e) - In this case, q2 = I with probability 1. The reason is 

that n _ < Y^mm(m,n) ^ _ > ^ _|_ = Q (where ai and a-i are the negative exponents of 

the eigenvalues of HH'^ and H2H\, where H2 is the non-power controlled channel estimate at the receiver and p is the 
SNR exponent m P [Hi]). Further, 92 = 1 because < 1 + W2 {r + — makes the received power >SNR'^/™". 



As $2 = 1, using power P \ H2j does not produce any outage when < 1 + W2 (r + e) — 

3) For u G [3, if - 1], det (/ + HH^P^ (^H^^^ < SNR\ 1 + Wu-i {r + e) - ^ < < 1 + W^-i (r + e) - ^. As 
before, q2 = 1. Now, since am > 1 + W2 (r + e) — the transmitter will receive this index as which the receiver 
would know. Hence, qs = 1. This continues till qu = 1. However, qu = ^ since a^n < i + Wu-i (r + e) — Moreover, 
sending at power P„ (^'') when am < 1 + Wu-i ('^ + ~ ^ does not produce any outage. 
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Hence, the outage happens only when det (/ + HH^Pt (#1 j j < SNR'" and > 1 + W„_i ('^ + e) - whose probability 

is less than the probability of > 1 + Wk-i {"t + — which happens with probability SNR"^^*''"'*''^-'. For e very small, 
the diversity multiplexing tradeoff as in the statement of the Theorem can be achieved. 
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